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Joint Source-Channel Codes for MIMO Block 

Fading Channels 

Deniz Giindiiz, Elza Erkip 
Abstract 

We consider transmission of a continuous amplitude source over an L-block Rayleigh fading Mt x Mr MIMO 
channel when the channel state information is only available at the receiver. Since the channel is not ergodic. Shannon's 
source-channel separation theorem becomes obsolete and the optimal performance requires a joint source -channel 
approach. Our goal is to minimize the expected end-to-end distortion, particularly in the high SNR regime. The figure 
of merit is the distortion exponent, defined as the exponential decay rate of the expected distortion with increasing 
SNR. We provide an upper bound and lower bounds for the distortion exponent with respect to the bandwidth ratio 
among the channel and source bandwidths. For the lower bounds, we analyze three different strategies based on 
layered source coding concatenated with progressive, superposition or hybrid digital/analog transmission. In each 
case, by adjusting the system parameters we optimize the distortion exponent as a function of the bandwidth ratio. 
We prove that the distortion exponent upper bound can be achieved when the channel has only one degree of freedom, 
that is L = 1, and min{Mt, Mr} — 1. When we have more degrees of freedom, our achievable distortion exponents 
meet the upper bound for only certain ranges of the bandwidth ratio. We demonstrate that our results, which were 
derived for a complex Gaussian source, can be extended to more general source distributions as well. 

Index Terms 

Broadcast codes, distortion exponent, diversity-multiplexing gain tradeoff, hybrid digital/analog coding, joint 
source-channel coding, multiple input- multiple output (MIMO), successive refinement. 

I. Introduction 

Recent advances in mobile computing and hardware technology enable transmission of rich multimedia 
contents over wireless networks. Examples include digital TV, voice and video transmission over cellular 
and wireless LAN networks, and sensor networks. With the high demand for such services, it becomes 
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crucial to identify the system limitations, define the appropriate performance metrics, and to design 
wireless systems that are capable of achieving the best performance by overcoming the challenges posed 
by the system requirements and the wireless environment. In general, multimedia wireless communica- 
tion requires transmitting analog sources over fading channels while satisfying the end-to-end average 
distortion and delay requirements of the application within the power limitations of the mobile terminal. 

Multiple antennas at the transceivers have been proposed as a viable tool that can remarkably improve 
the performance of multimedia transmission over wireless channels. The additional degrees of freedom 
provided by multiple input multiple output (MIMO) system can be utilized in the form of spatial 
multiplexing gain and/or spatial diversity gain, that is, either to transmit more information or to increase 
the reliability of the transmission. The tradeoff between these two gains is explicitly characterized as the 
diversity-multiplexing gain tradeoff (DMT) in [1]. How to translate this tradeoff into an improved overall 
system performance depends on the application requirements and limitations. 

In this paper, we consider transmission of a continuous amplitude source over a MIMO block Rayleigh 
fading channel. We are interested in minimizing the end-to-end average distortion of the source. We 
assume that the instantaneous channel state information is only available at the receiver (CSIR). We 
consider the case where K source samples are to be transmitted over L fading blocks spanning A'^ 
channel uses. We define the bandwidth ratio of the system as 



and analyze the system performance with respect to b. We assume that K is large enough to achieve 
the rate-distortion performance of the underlying source, and N is large enough to design codes that can 
achieve all rates below the instantaneous capacity of the block fading channel. 

We are particularly interested in the high SNR behavior of the expected distortion (ED) which is 
characterized by the distortion exponent [2]: 



Shannon's fundamental source-channel separation theorem does not apply to our scenario as the channel 
is no more ergodic. Thus, the optimal strategy requires a joint source-channel coding approach. The 
minimum expected end-to-end distortion depends on the source characteristics, the distortion metric, the 
power constraint of the transmitter, the joint compression, channel coding and transmission techniques 
used. 

Since we are interested in the average distortion of the system, this requires a strategy that performs 
'weir over a range of channel conditions. Our approach is to first compress the source into multiple layers, 
where each layer successively refines the previous layers, and then transmit these layers at varying rates. 




channel uses per source sample, 



(1) 



A = 



log ED 
lim 

5A'i?^oo log SNR 



(2) 
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hence providing unequal error protection so that the reconstructed signal quality can be adjusted to the 
instantaneous fading state without the availability of the channel state information at the transmitter 
(CSIT). We consider transmitting the source layers either progressively in time, layered source coding 
with progressive transmission (LS), or simultaneously by superposition, broadcast strategy with layered 
source (BS). We also discuss a hybrid digital-analog extension of LS called hybrid-LS (HLS). 

The characterization of distortion exponent for fading channels has recently been investigated in several 
papers. Distortion exponent is first defined in [2], and simple transmission schemes over two parallel 
fading channels are compared in terms of distortion exponent. Our prior work includes maximizing 
distortion exponent using layered source transmission for cooperative relay [3], [4], [5], for SISO [6], 
for MIMO [7], and for parallel channels [8]. HolUday and Goldsmith [9] analyze high SNR behavior of 
the expected distortion for single layer transmission over MIMO without explicitly giving the achieved 
distortion exponent. Hybrid digital-analog transmission, first proposed in [10] for the Gaussian broadcast 
channel, is considered in terms of distortion exponent for MIMO channel in [11]. 

Others have focused on minimizing the end-to-end source distortion for general SNR values [15], [16]. 
Recently, the LS and BS strategies introduced here have been analyzed for finite SNR and finite number 
of source layers in [17] -[22]. 

This paper derives explicit expressions for the achievable distortion exponent of LS, HLS and BS 
strategies, and compares the achievable exponents with an upper bound derived by assuming perfect 
channel state information at the transmitter. Our results reveal the following: 

• LS strategy, which can easily be implemented by concatenating a layered source coder with a 
MIMO channel encoder that time-shares among different code rates, improves the distortion exponent 
compared to the single-layer approach of [9] even with limited number of source layers. However, 
the distortion exponent of LS still falls short of the upper bound. 

• While the hybrid digital-analog scheme meets the distortion exponent upper bound for small band- 
width ratios as shown in [1 1], the improvement of hybrid extension of LS (HLS) over pure progressive 
layered digital transmission (LS) becomes insignificant as the bandwidth ratio or the number of digital 
layers increases. 

• Transmitting layers simultaneously as suggested by BS provides the optimal distortion exponent for 
all bandwidth ratios when the system has one degree of freedom, i.e., for single block MISO/SIMO 
systems, and for high bandwidth ratios for the general MIMO system. Hence, for the mentioned 
cases the problem of characterizing the distortion exponent is solved. 

• There is a close relationship between the DMT of the underlying MIMO channel and the achievable 
distortion exponent of the proposed schemes. For LS and HLS, we are able to give an explicit 
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characterization of the achievable distortion exponent once the DMT of the system is provided. For 
BS, we enforce successive decoding at the receiver and the achievable distortion exponent closely 
relates to the 'successive decoding diversity-multiplexing tradeoff' which will be rigorously defined 
in Section IVl 

• The correspondence between source transmission to a single user with unknown noise variance 
and multicasting to users with different noise levels [10] suggests that, our analysis would also 
apply to the multicasting case where each receiver has the same number of antennas and observes 
an independent block fading Rayleigh channel possibly with a different mean. Here the goal is to 
minimize the expected distortion of each user. Alternatively, each user may have a static channel, but 
the channel gains over the users may be randomly distributed with independent Rayleigh distribution, 
where the objective is to minimize the source distortion averaged over the users. 

• While minimizing the end-to-end distortion for finite SNR is still an open problem, in the high SNR 
regime we are able to provide a complete solution in certain scenarios. Using this high SNR analysis, 
it is also possible to generalize the results to non-Gaussian source distributions. Furthermore, LS and 
BS strategies motivate source and channel coding strategies that are shown to perform very well for 
finite SNRs as well [17] -[22]. 

We use the following notation throughout the paper. E[-] is the expectation, f{x) = g{x) is the 
exponential equaUty defined as lim^^^oo = 1, while > and < are defined similarly. Vectors and 

matrices are denoted with bold characters, where matrices are in capital letters. [-J-^ and [-J^ are the 
transpose and the conjugate transpose operations, respectively. tr(A) is the the trace of matrix A. For 
two Hermitian matrices A ^ B means that A — B is positive-semidefinite. {x)'^ is a; if x > 0, and 
otherwise. We denote the set {[xi, . . . ,Xn] ■ Xi G M+, Vi} by M"+. 

II. System Model 

We consider a discrete time continuous ampUtude (analog) source {sfcl^^' Sfc G M available at the 
transmitter. For the analysis, we focus on a memoryless, i.i.d., complex Gaussian source with independent 
real and imaginary components each with variance 1/2. We use the distortion-rate function of the complex 
Gaussian source D{R) = 2-^ where R is the source coding rate in bits per source sample, and consider 
compression strategies that meet the distortion-rate bound. Although in Sections HIDlVIl we use properties 
of this complex Gaussian source (such as its distortion-rate function and successive refinability), in Section 
IVIIII we prove that our results can be extended to any complex source with finite second moment and 
finite differential entropy, with squared-error distortion metric. As stated in Section Jl we assume that K 
source samples are transmitted in N channel uses which corresponds to a bandwidth ratio oi b = N/ K. 
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In all our derivations we allow for an arbitrary bandwidth ratio b > 0. 

We assume a MIMO block fading channel with Mt transmit and Mr receive antennas. The channel 
model is 

i = l,...,N 



H[f]x[i] +z[i], 



where 



SNR 



x[i] is the transmitted signal at time i, Z = [zi, . . . , zjy] € 



(3) 



is the complex Gaussian 



noise with i.i.d entries CJ\f{0,l), and H[i] G (j^A/rxA/t jj^g channel matrix at time i, which has i.i.d. 
entries with CJ\f{0, 1). We have an L-block fading channel, that is, the channel observes L different i.i.d. 
fading realizations Hi, . . . , H/, each lasting for N/L channel uses. Thus we have 



H 













k— + l 


= H 


k— + 2 


= ... = H 




iJ 




1j 







H 



k+l, 



(4) 



for = 0, . . . , L — 1 assuming N/L is integer. The realization of the channel matrix Hj is assumed to be 
known by the receiver and unknown by the transmitter, while the transmitter knows the statistics of Hj. 



The codeword, X 



x[l], . . . ,XL 



^MtxN 



is normalized so that it satisfies tr{E[X.'^X\) < MtN. 



We assume Gaussian codebooks which can achieve the instantaneous capacity of the MIMO channel. 
We define = min{Mt,Mr} and M* = max{Mt,Mr}. 

The source is transmitted through the channel using one of the joint source-channel coding schemes dis- 



[si, . . 



to an 



cussed in this paper. In general, the source encoder matches the i^-length source vector 
to the channel input X. The decoder maps the received signal Y = y[l], . . . ,y[-/V] € C 
estimate s E of the source. Average distortion ED{SNR) is defined as the average mean squared 
error between s and s at average channel signal-to-noise ratio SNR, where the expectation is taken 
with respect to all source samples, channel realizations and the channel noise. The exact expression of 
ED{SNR) for the strategies introduced will be provided in the respective sections. 

As mentioned in Section U we are interested in the high SNR behavior of the expected distortion. We 
optimize the system performance to maximize the distortion exponent defined in Eqn. ([2]). A distortion 
exponent of A means that the expected distortion decays as SNR^^ with increasing SNR when SNR 
is high. 

In order to obtain the end-to-end distortion for our proposed strategies, we will need to characterize 
the error rate of the MIMO channel. Since we are interested in the high SNR regime, we use the outage 
probability, which has the same exponential behavior as the channel error probability [1]. For a family 

of codes with rate R = r log SNR, r is defined as the multiplexing gain of the family, and 

log PoutiSNR) 



d{r) 



lim 



(5) 



SNR^oo log SNR 

as the diversity advantage, where Pout{SNR) is the outage probability of the code. The diversity 
gain d*{r) is defined as the supremum of the diversity advantage over all possible code families with 
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multiplexing gain r. In [1], it is shown that there is a fundamental tradeoff between multiplexing 
and diversity gains, also known as the diversity-multiplexing gain tradeoff (DMT), and this tradeoff 
is explicitly characterized with the following theorem. 

Theorem 2.1: (Corollary 8, [1]) For an Mt x MIMO L-block fading channel, the optimal tradeoff 
curve d*{r) is given by the piecewise-linear function connecting the points {k, d*{k)), A; = 0, 1, ... , M*, 
where 

d*{k) = L{Mt- k){Mr - k). (6) 

III. Distortion Exponent Upper Bound 

Before we study the performance of various source-channel coding strategies, we calculate an upper 
bound for the distortion exponent of the MIMO L-block fading channel, assuming that the transmitter 
has access to perfect channel state information at the beginning of each block. Then the source-channel 
separation theorem applies for each block and transmission at the highest rate is possible with zero outage 
probability. 

Theorem 3.1: For transmission of a memoryless i.i.d. complex Gaussian source over an L-block Mt x 
Mj. MIMO channel, the distortion exponent is upper bounded by 

A^^ = L^miiK^ -,2i - 1 + |Mt - M^l I . (7) 
i=i ^ J 

Proof: Proof of the theorem can be found in Appendix U ■ 
Note that, increasing the number of antennas at either the transmitter or the receiver by one does 
not provide an increase in the distortion exponent upper bound foib < L(l + \Mt — Mj.|), since the 
performance in this region is bounded by the bandwidth ratio. Adding one antenna to both sides increases 
the upper bound for all bandwidth ratios, while the increase is more pronounced for higher bandwidth 
ratios. The distortion exponent is bounded by the highest diversity gain LMtM^. 

In the case of M x 1 MISO system, and alternatively 1 x M SIMO system, the upper bound can be 
simplified to 

A^f/50/57AfO = min{Z7,LM}. (8) 

We next discuss how a simple transmission strategy consisting of single layer digital transmission per- 
forms with respect to the upper bound. In single layer digital transmission, the source is first compressed 
at a specific rate bR, the compressed bits are channel coded at rate R, and then transmitted over the 
channel. This is the approach taken in [2] for two-parallel channels, in [3] for cooperative relay channels, 
and in [9] for the MIMO channel to transmit an analog source over a fading channel. Even though 
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Fig. 1. A geometric interpretation illustrating the optimal multiplexing gain for a single layer source-channel coding system. The intersection 
point of the DMT curve and the line y=br gives the optimal multiplexing gain- distortion exponent pair. 



compression and channel coding are done separately, the rate is a common parameter that can be chosen 
to minimize the end-to-end distortion. Note that the transmitter choses this rate R without any channel 
state information. 

The expected distortion of single layer transmission can be written as 

ED{R, SNR) = (1 - Pout{R, SNR))D{bR) + SNR), (9) 

where Pout{R, SNR) is the outage probability at rate R for given SNR, and D{R) is the distortion-rate 
function of the source. Here we assume that, in case of an outage, the decoder simply outputs the mean 
of the source leading to the highest possible distortion of 1 due to the unit variance assumption. At fixed 
SNR, there is a tradeoff between reliable transmission over the channel (through the outage probability), 
and increased fidelity in source reconstruction (through the distortion-rate function). This suggests that 
there is an optimal transmission rate that achieves the optimal average distortion. For any given SNR this 
optimal R can be found using the exact expressions for PoutiR, SNR) and D{R). 

In order to study the distortion exponent, we concentrate on the high SNR approximation of Eqn. Q. 
To achieve a vanishing expected distortion in the high SNR regime we need to increase R with SNR. 
Scaling R faster than O {log SNR) would result in an outage probability of 1, since the instantaneous 
channel capacity of the MIMO system scales as log SNR. Thus we assign R = r log SNR, where 
< r < M^,. Then the high SNR approximation of Eqn. ^ is 

ED = D{bR) + PoutiR), 

= SNR'^'' + SNR-'^'^''\ (10) 
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1 2 3 4 5 

Bandwidth ratio (b) 



Fig. 2. Upper bound and single layer achievable distortion exponents for 2x2 and 4x1 MIMO systems. 

Of the two terms, the one with the highest SNR exponent would be dominant in the high SNR regime. 
Maximum distortion exponent is achieved when both terms have the same SNR exponent. Then the 
optimal multiplexing gain r* satisfies 

A = br* = d*{r*), (11) 

where A is the corresponding distortion exponent. Eqn. (fTTI ) suggests an optimal operating point on the 
DMT curve to maximize the distortion exponent of the single layer scheme. 

Figure[T]shows a geometric illustration of the optimal multiplexing gain and the corresponding distortion 
exponent. A similar approach was taken in [9] for single layer transmission with the restriction of integer 
multiplexing gains, and later extended to all multiplexing gains in [11]. However, as we argue next, 
even when all multiplexing gains are considered, this single layer approach is far from exploiting all the 
resources provided by the system. 

In Figure [2j we illustrate the distortion exponent upper bound and the distortion exponent of the single 
layer scheme for 4 x 1 MISO and 2x2 MIMO systems. We observe a significant gap between the upper 
bounds and the single layer distortion exponents in both cases for all bandwidth ratios. This gap gets 
larger with increasing degrees of freedom and increasing bandwidth ratio. 

The major drawback of the single layer digital scheme is that it suffers from the threshold effect, i.e., 
error probability is bounded away from zero or an outage occurs when the channel quality is worse 
than a certain threshold, which is determined by the attempted rate. Furthermore, single layer digital 
transmission cannot utilize the increase in the channel quality beyond this threshold. Lack of CSIT 
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makes only a statistical optimization of the compression/transmission rate possible. To make the system 
less sensitive to the variations in the channel quality, we will concentrate on layered source coding 
where the channel codewords corresponding to different layers are assigned different rates. Using the 
successive retinability of the source, we transmit more important compressed bits with higher reliability. 
The additional refinement bits are received when the channel quality is high. This provides adaptation to 
the channel quality without the transmitter actually knowing the instantaneous fading levels. We argue 
that, due to the exponential decay of the distortion-rate function in general, layering increases the overall 
system performance from the distortion exponent perspective. Our analysis in the following sections 
proves this claim. 

IV. Layered source coding with progressive transmission and Hybrid Digital- Analog 

EXTENSION 

The first source-channel coding scheme we consider is based on compression of the source in layers, 
where each layer is a refinement of the previous ones, and transmission of these layers successively in 
time using channel codes of different rates. We call this scheme layered source coding with progressive 
transmission (LS). This classical idea, mostly referred as progressive coding, has been used to various 
extents in the image and video standards such as JPEG2000 and MPEG-4. After analyzing the distortion 
exponent of LS in Section IIV-AI in Section IIV-BI we consider a hybrid digital-analog extension called 
hybrid LS (HLS) where the error signal is transmitted without coding. In this section we analyze single 
block fading, i.e., L = 1, for clarity of presentation. Generalization to the multiple block case {L > 1) 
will be a straightforward extension of the techniques presented here and will be briefly discussed in 
Section |Vll 

A. Layered Source Coding with Progressive Transmission (LS) 

We assume that the source encoder has n layers with each layer transmitted over the channel at rate 
Ri bits per channel use in tiN channel uses for i = 1, 2, . . . , n, with Y17=i ~ ^- "^^^^ illustrated in 
Fig. [Ha). We assume that tiN is large enough to approach the instantaneous channel capacity. For each 
layer this corresponds to a source coding rate of btiRi bits per sample, where b is the bandwidth ratio 
defined in ([T|). The ith layer is composed of the successive refinement bits for the previous i — I layers. 
The transmission power is kept constant for each layer, so the optimization variables are the rate vector 
R = [Ri, . . . , Rn] and the channel allocation vector t = [ti, . . . , 

Let Pg^j denote the outage probability of layer i, i.e., = Pr{C(H) < Ri}. Using successive 
refinability of the complex Gaussian source [23], the distortion achieved when the first i layers are 
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M N channel uses ► 



Ri 
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• • • 


Rn 


tiN t2N 
(a) LS strategy with n 


layers. 
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• • • 
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Analog transmission 


(b) HLS strategy with 


n layers for b > 






Ri,SNRi 


• 
• 
• 


Rn, SNRn 



(c) BS strategy with n layers. Layers are transmitted simultaneously 
with total power allocated among them. 



Fig. 3. Channel and power allocation for different transmission strategies explored in the paper 



successfully decoded is 

with Dq^ = 1. Note that due to successive refinement source coding, a layer is useless unless all the 

preceding layers are received successfully. This imposes a non-decreasing rate allocation among the 
layers, i.e., Ri < Rj for any j > i. Then the expected distortion (ED) for such a rate allocation can be 
written as 

n 

ED{K, t, SNR) = ^ i^f ^ • (P^+i - P^^) , (13) 



i=0 



where we define P°„j = and PJ^J^^ = 1. 
The minimization problem to be solved is 



min ED(R,t,SNR) 

s.t. E7=iti = i, 

(14) 

ti>0, for i = 1, . . . ,n 

0<Rl<R2<-<Rn- 



This is a non-linear optimization problem which can be untractable for a given SNR. An algorithm solving 
the above optimization problem for finite SNR is proposed in [17]. However when we focus on the high 
SNR regime and compute the distortion exponent A, we will be able to obtain explicit expressions. 

In order to have a vanishing expected distortion in Eqn. (fT3] ) with increasing SNR, we need to 
increase the transmission rates of all the layers with SNR as argued in the single layer case. We let 
the multiplexing gain vector be r = [n, . . . , r„]^, hence R = r log SNR. The ordering of rates is 
translated into multiplexing gains as < ri < • • • < r„. Using the DMT of the MIMO system under 
consideration and the distortion-rate function of the complex Gaussian source, we get 

n 

ED{'R,SNR) = ^ [5A^i?-'='*(''''+^) - 5iVi?-'^'(''^-) SNR-''^'=^^^'"^ 



k=0 

Un 

k=0 



k=0 



where d*{rn+i) = 0, and the last exponential equality arises because the summation will be dominated 
by the slowest decay in high SNR regime. Then the optimal LS distortion exponent can be written as 

A^^ = max min |(i*(rfc+i) +Z7 VtirA (16) 

r,t 0<fc<n ^ ^ ' 



i=l 



n 



S.t. 



i=l 

> 0, for i = 1, . . . , n 
< n < r2 < • • • < r„ < M*. 

Assuming a given channel allocation among n layers, i.e., t is given, the Karush-Kuhn-Tucker (KKT) 
conditions for the optimization problem of ([T6l) lead to: 

UnVn = d*{rn), (17) 

d*{rn)+htn-irn-i = d*{rn-i), (18) 



d*{r2) + bhn = d*{n), (19) 

where the corresponding distortion exponent is A^"^ = d*{ri). 

The equations in (fT7l)-(fT9l) can be graphically illustrated on the DMT curve as shown in Fig. IH This 
illustration suggests that, for given channel allocation, finding the distortion exponent in n-layer LS can 
be formulated geometrically: We have n straight lines each with slope bti for i = I, ... ,n, and each line 
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d*{r) 
A 




Fig. 4. Rate allocation for the source layers of LS illustrated on DMT curve of the MIMO channel. 

intersects the y-axis at a point with the same ordinate as the intersection of the previous line with the 
DMT curve. Although the total slope is always equal to b, the more layers we have, the higher we can 
climb on the tradeoff curve and obtain a larger A. 

The distortion exponent of LS in the limit of infinite layers provides a benchmark for the performance 
of LS in general. The following lemma will be used to characterize the optimal LS distortion exponent 
in the limit of infinite layers. 

Lemma 4.1: In the limit of infinite layers, i.e., as n — > oo, the optimal distortion exponent for LS can 
be achieved by allocating the channel equally among the layers. 

Proof: Proof of the lemma can be found in Appendix |lll ■ 

The next theorem provides an explicit characterization of the asymptotic optimal LS distortion exponent 
A^'^ (in the case of infinite layers) for an Mt x Mr MIMO system. 

Theorem 4.2: Let the sequence {cj} be defined as cq = 0, Cj = Cj_i + (|Mr — M(|+2i — 1) In ^ ^"1"^^ ^ 
for i = 1, . . . , M* — 1 and cm, = co- The optimal distortion exponent of infinite layer LS is given by: 
A^^ = E?=i {\Mr-Mt\ + 2i-l) 

+(M* -p+l){\Mr - Mt\ + 2p- 1)(1 - e~ i^'''-"ti+2p-i)^ 
for Cp-i < b < Cp, p = 1, . . . , M*. 
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Proof: Proof of the theorem can be found in Appendix Jill ■ 
Corollary 4.3: For a MISO/SIMO system, we have 

^Miso/siMO = - e-'""'). (20) 
Illustration of A^'^ for some specific examples as well as comparison with the upper bound and other 
strategies is left to Section IVIII However, we note here that, although LS improves significantly compared 
to the single layer scheme, it still falls short of the upper bound. Nevertheless, the advantage of LS is the 
simple nature of its transceivers. We only need layered source coding and rate adaptation among layers 
while power adaptation is not required. 

Another important observation is that, the geometrical model provided in Fig. |4] and Theorem 14.21 
easily extends to any other system utilizing LS once the DMT is given. This is done in [4], [5] for a 
cooperative system, and will be earned out to extend the results to multiple block fading (L > 1) and 
parallel channels in Section IVTl 

B. Hybrid Digital-Analog Transmission with Layered Source (HLS) 

In [11], the hybrid digital-analog technique proposed in [10] is analyzed in terms of the distortion 
exponent, and is shown to be optimal for bandwidth ratios b < 1/M*. For higher bandwidth ratios, 
while the proposed hybrid strategy improves the distortion exponent compared to single layer digital 
transmission, its performance falls short of the upper bound. Here, we show that, combining the analog 
transmission with LS further improves the distortion exponent for b > 1/M*. We call this technique 
hybrid digital-analog transmission with layered source (HLS). We will show that, introduction of the 
analog transmission will improve the distortion exponent compared to LS with the same number of digital 
layers, however; the improvement becomes insignificant as the number of layers increases. 

For b > 1/M*, we divide the N channel uses into two portions. In the first portion which is composed 
of — K/M^ channel uses, n source layers are channel coded and transmitted progressively in time in 
the same manner as LS. The remaining K/M^, channel uses are reserved to transmit the error signal in 
an analog fashion described below. Channel allocation for HLS is illustrated in Fig. |3lb). 

Let s G be the reconstruction of the source s upon successful reception of all the digital layers. 
We denote the reconstruction error as e G C^^ where e = s — s. This error is mapped to the transmit 
antennas where each component of the error vector is transmitted without coding in an analog fashion, 
just by scaling within the power constraint. Since rank(H) < M*, degrees of freedom of the channel is 
at most at each channel use. Hence, at each channel use we utilize M^: of the Mt transmit antennas 
and in K/M^ channel uses we transmit all K components of the error vector e. HLS encoder is shown 
in Fig. m 
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Fig. 5. Encoder for the n layer HLS for b > l/A/*. 



Receiver first tries to decode all the digitally transmitted layers as in LS, and in case of successful 
reception of all the layers, it forms the estimate s + e, where e is the linear MMSE estimate of e. This 
analog portion is ignored unless all digitally transmitted layers are successfully decoded at the destination. 

The expected distortion for n-layer HLS can be written as 



EDin, SNR) 



n-l 
i=0 



P. 



out 



Pout) 



where P^^t 



and 



0, = 1> Dl^"-' = 0, 



'0 



Di{b 



fe=l / 



for i = 1, . . . , n 



r,HLS 



X/ 1 I SNR 



(21) 



(22) 



(23) 



i=l ^ ' M, 

where A denotes the set of channel states at which the n'th layer is in outage, A = [Ai, . . . , A^fJ are 
the eigenvalues of H^H, H is the x constrained channel matrix, and A = [Ai, . . . , Aa/J are the 
eigenvalues of H^H. 

Note that the expected distortion of HLS in Eqn. (|2T1) contains two terms. The first term which consists 
of the finite sum can be obtained similar to LS by using appropriate source coding rates. The following 
lemma will be used to characterize the high SNR behavior of the second term. 

Lemma 4.4: Suppose that the transmission rate of the n-th layer for HLS is i? = r log SNR, where 
r < is the multiplexing gain and that A denotes the outage event for this layer. If the average 
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signal-to-noise ratio for the analog part is SNR, we have 

i M ^TT^^^(^)^^ ^ ^^^^ 

y_4c ivi* i -t- -jj^Ai 

Proof: Proof of the lemma can be found in Appendix |lVl ■ 
Then the second term of the expected distortion in (|2TI) can be shown to be exponentially less than or 
equal to 

5iVi?"N''"^^^^=^*^]. (25) 

Note that at high SNR, both LS and HLS have very similar ED expressions. Assuming the worst SNR 
exponent for the second term, the high SNR approximation of Eqn. (|2TI ) can be written as in Eqn. ( fTSl ). 
except for the following: i) the bandwidth ratio b in ([TS] ) is replaced by 6 — -p-, and ii) the n-th term 
in ( fTSl ) is replaced by ( [25] ). Hence, for a given time allocation vector t, we obtain the following set of 
equations for the optimal multiplexing gain allocation: 

l+{b-^^tnrn = d*{rn), (26) 
d*{r2)+{b-^]tin = d*{n), (28) 



d*{rn) + {b- ) t„_ir„_i = d*(r„_i), (27) 



M, 

where the corresponding distortion exponent is again A^^^ = d*{ri). Similar to LS, this formulation 
enables us to obtain an explicit formulation of the distortion exponent of infinite layer HLS using the 
DMT curve. For brevity we omit the general MIMO HLS distortion exponent and only give the expression 
for 2 X 2 MIMO and general MISO/SIMO systems for comparison. 

Corollary 4.5: For 2x2 MIMO, HLS distortion exponent with infinite layers for 6 > 1/2 is given by 

A^^^ = l + 3[l-e-i(^-i)]. 

Corollary 4.6: For a MISO/SIMO system utilizing HLS, we have (for b > 1) 

KYso/siMO = M*- {M* - l)e-(''-i)/*^* . (29) 
Proof: For MISO/SIMO, we have = 1. For n layer HLS with equal time allocation, using 
Lemma 13.11 in Appendix JII] we obtain the distortion exponent 



1 



Kifso/siMO,n = M* -{M* \ . (30) 

V -"^ + riM' ) 

Since equal channel allocation in the limit of infinite layers is optimal, taking the limit as n — > cxd, we 
obtain (|29l). ■ 
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Comparing ^f/jso/siMO "^^^^ '-^aiiso/simo Corollary 14.31 we observe that 

^MISO/SIMO ^MISO/SIMO — L^*^ W^'-' '-J'^ J- K^'^) 

For a given MISO/SIMO system with a fixed number of M* antennas, the improvement of HLS 
over LS exponentially decays to zero as the bandwidth ratio increases. Since we have b > 1/M*, 
the biggest improvement of HLS compared to LS is achieved when b = 1/M* = 1, and is equal to 
e~^/*^*[M* — {M* — l)e^/^^']. This is a decreasing function of M*, and achieves its highest value at 
M* = 1, i.e., SISO system, at 6 = 1, and is equal to 1/e. Illustration of A^^'^ for some specific examples 
as well as a comparison with the upper bound and other strategies is left to Section IVIII 

V. Broadcast Strategy with Layered Source (BS) 

In this section we consider superimposing multiple source layers rather than sending them successively 
in time. We observe that this leads to higher distortion exponent than LS and HLS, and is in fact optimal 
in certain cases. This strategy will be called 'broadcast strategy with layered source' (BS). 

BS combines broadcasting ideas of [24] -[27] with layered source coding. Similar to LS, source 
information is sent in layers, where each layer consists of the successive refinement information for 
the previous layers. As in Section |IV] we enumerate the layers from 1 to n such that the zth layer is the 
successive refinement layer for the preceding i — l layers. The codes corresponding to different layers are 
superimposed, assigned different power levels and sent simultaneously throughout the whole transmission 
block. Compared to LS, interference among different layers is traded off for increased multiplexing gain 
for each layer. We consider successive decoding at the receiver, where the layers are decoded in order 
from 1 to n and the decoded codewords are subtracted from the received signal. 

Similar to Section |IV] we limit our analysis to single block fading (L = 1) scenario and leave the 
discussion of the multiple block case to Section |Vll We first state the general optimization problem 
for arbitrary SNR and then study the high SNR behavior. Let R = [Ri,R2, . . . ,Rn]'^ be the vector of 
channel coding rates, which corresponds to a source coding rate vector of bH as each code is spread 
over the whole N channel uses. Let SNR = [SNRi, SNR^f denote the power allocation vector 
for these layers with Yli^=i SNRi = SNR. Fig. [3lc) illustrates the channel and power allocation for BS. 
For i = 1 , . . . , n we define 

n 

'SNR^ = ^SNRj. (32) 

j=i 

The received signal over channel uses can be written as 



Y = hJ:J^X. + Z. (33) 



1 = 1 
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where Z G C^"^^ is the additive complex Gaussian noise. We assume each Xj G C^*^^ is generated 
from i.i.d. Gaussian codebooks satisfying tr(£;[XiX|]) < MtN . Here ^^^Xj carries information for 
the i-th source coding layer. For fc = 1, . . . , n, we define 



j=k 

and 



Yfe = HXfc + Z. (35) 

Note that Y^. is the remaining signal at the receiver after decoding and subtracting the first k — 1 layers. 
Denoting T(Yk] X/j) as the mutual information between and X^, we can define the following outage 
events, 

Ak = {H:2:(Yfe;Xfc)<i?fe}, (36) 

k 

Bk = [jAu (37) 

i=l 

and the corresponding outage probabilities 

= Pr{H:HeA}, (38) 
Pi, = Pr {H:HG fife}. (39) 

We note that P^, denotes the probabihty of outage for layer k given that the decoder already has access 
to the previous A; — 1 layers. On the other hand, is the overall outage probability of layer k in 
case of successive decoding, where we assume that if layer k cannot be decoded, then the receiver will 
not attempt to decode the subsequent layers i > k. Then the expected distortion for n-layer BS using 
successive decoding can be written as 

n 

ED(R,SNR) = J2^f'iP'^t-PLt), (40) 

i=0 

where 

Pout = 0' Pout^ — 1' ^'^'^ P^Q^ — 1- Various algorithms solving this optimization problem are proposed 
in [15],[18]-[22]. 

The following definition will be useful in characterizing the distortion exponent of BS. 
Definition 5.1: The 'successive decoding diversity gain' for layer k of the BS strategy is defined as 
the high SNR exponent of the outage probabihty of that layer using successive decoding at the receiver. 
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The successive decoding diversity gain can be written as 

J / \ A , . log Pout 

dsd{rk) = - lim 



SNR^oo log SNR 

Note that the successive decoding diversity gain for layer k depends on the power and multiplexing gain 
allocation for layers 1, . . . , A; — 1 as well as layer k itself. However, we drop the dependence on the 
previous layers for simplicity. 

For any communication system with DMT characterized by d*{r), the successive decoding diversity 
gain for layer k satisfies dsdir^) < d*(ri + . . . + r^). Concurrent work by Diggavi and Tse [28], coins 
the term 'successive refinability of the DMT curve' when this inequality is satisfied with equality, i.e. 
multiple layers of information simultaneously operate on the DMT curve of the system. Our work, carried 
out independently, illustrates that combining successive refinability of the source and the successive 
refinability of the DMT curve leads to an optimal distortion exponent in certain cases. 

From ( |40l ). we can write the high SNR approximation for ED as below. 

n 

ED = 



i=0 

n 



= ^SNR-'^^^'^'^'+'-^SNR-^^U^^. (41) 
Then the distortion exponent is given by 



= min I 4,(r,+i) + 6 J] r,^ • (42) 



Note that, while the DMT curve for a given system is enough to find the corresponding distortion exponent 
for LS and HLS, in the case of BS, we need the successive decoding DMT curve. 

Next, we propose a power allocation among layers for a given multiplexing gain vector. For a general 
Mt X Mj. MIMO system, we consider multiplexing gain vectors r = [ri, . . . , r„]such that ri + • • • + r„ < 
1. This constraint ensures that we obtain an increasing and nonzero sequence of outage probabilities 
{-fou*}fc=i- impose the following power allocation among the layers: 

'SNRk = 5Ari?i-('^i+-+^'=-i+^'^-i), (43) 

for A; = 2, . . . , n and < ei < • • • < e„_i. 

Our next theorem computes the successive decoding diversity gain obtained with the above power 
allocation. We will see that the proposed power allocation scheme results in successive refinement of 
the DMT curve for MISO/SIMO systems. By optimizing the multiplexing gain r we will show that the 
optimal distortion exponent for MISO/SIMO meets the distortion exponent upper bound. 
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Theorem 5.1: For Mt x MIMO, the successive decoding diversity gain for the power allocation in 
(|43] ) is given by 

dsd{rk) = M*M,(1 - n Tk-i) - (M* + - l)rfc. (44) 

Proof: Proof of the theorem can be found in Appendix |Vl ■ 
Corollary 5.2: The power allocation in (l43l) results in the successive refinement of the DMT curve for 
MISO/SIMO systems. 

Proof: For MISO/SIMO we have = 1. By Theorem O we have dsd{rk) = M*{l-ri 

rk) = d*{ri + ■ ■ ■ + rk)- Thus all simultaneously transmitted n layers operate on the DMT curve. ■ 

Using the successive decoding DMT curve of Theorem 15.11 the next theorem computes an achievable 
distortion exponent for BS by optimizing the multiplexing gain allocation among layers. 

Theorem 5.3: For Mt x Mr MIMO, n-layer BS with power allocation in (1431 ) achieves a distortion 
exponent of 

^ AMt-k){Mr-m-r^l) 

^ \Mt-k){Mr-k)-brk' 

for b G [{Mt - k - l){Mr - k - 1), {Mt - k){Mr - k)) , k = 0, . . . , M^ - 1, where 

,-(M,-t-l)W-fc-l) 

Mt + Mr-2k-l ' 



and 



ss _ n{MtMry 

nMtMr + Mt + Mr - 1'' 



.BS 



(48) 



for b > MtMr. 

In the limit of infinite layers, BS distortion exponent becomes 

b if b < MtMr, 

MtMr if b> MtMr. 
Hence, BS is distortion exponent optimal for b > MtMr. 

Proof: Proof of the theorem can be found in Appendix |Vll ■ 
Corollary 5.4: For a MISO/SIMO system, the n-layer BS distortion exponent achieved by the power 
allocation in (1431 ) is 

^MISO/SIMO,n - 1 - (6/M*)"+l J ■ 

In the limit of infinite layers, we obtain 

.BS _) ^ if ^ < 

'-^MISO/SIMO — \ (50) 

' M* if 6 > M*. 
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SISO, L = 1 
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Fig. 6. Distortion exponent vs. bandwidth ratio for SISO cliannel, L = I. 

BS with infinite source layers meets the distortion exponent upper bound of MISO/SIMO given in ^ 
for all bandwidth ratios, hence is optimal. Thus, (l50l) fully characterizes the optimal distortion exponent 
for MISO/SIMO systems. 

Recently, [12], [13] reported improved BS distortion exponents for general MIMO by a more advanced 
power allocation strategy. Also, while a successively refinable DMT would increase the distortion ex- 
ponent, we do not know whether it is essential to achieve the distortion exponent upper bound given 
in Theorem 13.11 However, successive refinement of general MIMO DMT has not been established [28], 
[29]. 

As in Section [TVl the discussion of the results is left to Section IVIII 

VI. Multiple Block Fading Channel 

In this section, we extend the results for L = 1 to multiple block fading MIMO, i.e., L > 1. As we 
observed throughout the previous sections, the distortion exponent of the system is strongly related to 
the maximum diversity gain available over the channel. In the multiple block fading scenario, channel 
experiences L independent fading realizations during N channel uses, so we have L times more diversity 
as reflected in the DMT of Theorem 12. II The distortion exponent upper bound in Theorem l3. ll promises a 
similar improvement in the distortion exponent for multiple block fading. However, we note that increasing 
L improves the upper bound only if the bandwidth ratio is greater than \Mt — Mr \ + 1, since the upper 
bound is limited by the bandwidth ratio, not the diversity at low bandwidth ratios. 

Following the discussion in Section |lVl extension of LS to multiple block fading is straightforward. 
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Fig. 7. Expected distortion vs. SNR plots for b = 2. The topmost curve LS, BS (1 layer) corresponds to single layer transmission. 



As before, each layer of the successive refinement source code should operate on a different point of 
the DMT. This requires us to transmit codewords that span all channel realizations, thus we divide each 
fading block among n layers and transmit the codeword of each layer over its L portions. 

Without going into the details of an explicit derivation of the distortion exponent for the multiple block 
fading case, as an example, we find the LS distortion exponent for 2 x 2 MIMO channel with L = 2 as 

f 4(1 -exp(-V2)) if 0<Z7<21n2, 

[ 2 + 6[l-exp(-(|-i|^))] if Z7>21n2. 
For HLS, the threshold bandwidth ratio 1/Af* is the same as single block fading as it depends on the 
channel rank per channel use, not the number of different channel realizations. For b > extension 
of HLS to multiple blocks can be done similar to LS. As an example, for 2-block Rayleigh fading 2x2 
MIMO channel, the optimal distortion exponent of HLS for ^ > 1/2 is given by 

HLS_] l + 3[l-exp(-i(Z;-i))] if l/2<Z;<i + 21nf, 



2 + 6 [l -exp i - 21n|))] if Z7>i + 21n|. 

For BS over multiple block fading MIMO, we use a generalization of the power allocation introduced 
in Section |V] in (1431 ). For L-block fading channel and for /c = 2, . . . , n let 



SNRk = 5iVi?i--^('^i+-+^^-i-^'=-i), (53) 

with < ei < • • • < en-i and imposing X^iLi ^ 1/L. Using this power allocation scheme, we obtain 
the following distortion exponent for BS over L-block MIMO channel. 



22 



Theorem 6.1: For L-block Mt x MIMO, BS with power allocation in (1531 ) achieves the following 
distortion exponent in the limit of infinite layers. 



The above generalizations to multiple block fading can be adapted to parallel channels through a scaling 
of the bandwidth ratio by L [8]. Note that, in the block fading model, each fading block lasts for N/L 
channel uses. However, for L parallel channels with independent fading, each block lasts for channel 
uses instead. Using the power allocation in (|43] ) we can get achievable BS distortion exponent for parallel 
channels. We refer the reader to [8] for details and comparison. Detailed discussion and comparison of 
the L-block LS, HLS and BS distortion exponents are left to Section IVlIl 

Distortion exponent for parallel channels has also been studied in [2] and [14] both of which consider 
source and channel coding for two parallel fading channels. The analysis of [2] is limited to single 
layer source coding and multiple description source coding. Both schemes perform worse than the upper 
bound in Theorem 13. II and the achievable strategies presented in this paper. Particularly, the best distortion 
exponent achieved in [2] is by single layer source coding and parallel channel coding which is equivalent to 
LS with one layer. In [14], although 2-layer successive refinement and hybrid digital-analog transmission 
are considered, parallel channel coding is not used, thus the achievable performance is limited. The hybrid 
scheme proposed in [14] is a repetition based scheme and cannot improve the distortion exponent beyond 
single layer LS. 



This section contains a discussion and comparison of all the schemes proposed in this paper and the 
upper bound. We first consider the special case of single-input single-output (SISO) system. For a SISO 
single block Rayleigh fading channel, the upper bound for optimal distortion exponent in Theorem 13.11 
can be written as 



This optimal distortion exponent is achieved by BS in the limit of infinite source coding layers (Corollary 
15.21 ) and by HLS [11]. In HLS, pure analog transmission is enough to reach the upper bound when b >\ 
[6], while the hybrid scheme of [11] achieves the optimal distortion exponent for b < 1. 




(54) 



This distortion exponent meets the upper bound for h > L'^MtMy. 



Proof: Proof of the theorem can be found in Appendix IVIII 



VII. Discussion of the Results 




(55) 
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Fig. 8. Distortion exponent vs. bandwidtti ratio for 4x1 MIMO, L = 1. 

The distortion exponent vs. bandwidth ratio of the various schemes for the SISO channel, L = 1 are 
plotted in Fig. [6] The figure suggests that while BS is optimal in the limit of infinite source layers, even 
with 10 layers, the performance is very close to optimal for almost all bandwidth ratios. 

For a SISO channel, when the performance measure is the expected channel rate, most of the im- 
provement provided by the broadcast strategy can be obtained with two layers [30]. However, our results 
show that when the performance measure is the expected end-to-end distortion, increasing the number 
of superimposed layers in BS further improves the performance especially for bandwidth ratios close to 
1. 

In order to illustrate how the suggested source-channel coding techniques perform for arbitrary SNR 
values for the SISO channel, in Fig. |7] we plot the expected distortion vs. SNR for single layer 
transmission (LS, BS with 1 layer), LS and BS with 2 layers, analog transmission and the upper bound 
for 6 = 2. The results are obtained from an exhaustive search over all possible rate, channel and power 
allocations. The figure illustrates that the theoretical distortion exponent values that were found as a result 
of the high SNR analysis hold, in general, even for moderate SNR values. 

In Fig. m we plot the distortion exponent versus bandwidth ratio of 4 x 1 MIMO single block 
fading channel for different source-channel strategies discussed in Section HVllVl as well as the upper 
bound. As stated in Corollary 15.21 the distortion exponent of BS coincides with the upper bound for all 
bandwidth ratios. We observe that HLS is optimal up to a bandwidth ratio of 1. This is attractive for 
practical applications since only a single coded layer is used, while BS requires many more layers to be 
superimposed. However, the performance of HLS degrades significantly beyond 6 = 1, making BS more 
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Fig. 9. Distortion exponent vs. bandwidth ratio for 2 X 2 MIMO, L = 1. 

2-block 2x2 IVIIMO channel 
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Fig. 10. Distortion exponent vs. bandwidtli ratio for 2x2 MIMO, L = 2. 



advantageous in this region. Pure analog transmission of the source samples would still be limited to a 
distortion exponent of 1 as in SISO, since linear encoding/decoding can not utilize the diversity gain of 
the system. More advanced nonlinear analog schemes which would take advantage of the diversity gain 
and achieve an improved distortion exponent may be worth exploring. While LS does not require any 
superposition or power allocation among layers, and only uses a digital encoder/decoder pair which can 
transmit at variable rates, the performance is far below BS. Nevertheless, the improvement of infinite 
layer LS compared to a single layer strategy is significant. 
We plot the distortion exponent versus bandwidth ratio for 2 x 2 MIMO with L = 1 in Fig. |9] We 
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observe that BS is optimal for & > 4 and provides the best distortion exponent for b > 2.4. For bandwidth 
ratios 1/2 < 6 < 4, none of the strategies discussed in this paper achieves the upper bound. 

Note that, both for 4 x 1 MISO and 2x2 MIMO, when b >1 /M* the gain due to the analog portion, 
i.e., gain of HLS compared to LS, is more significant for one layer and decreases as the number of layers 
goes to infinity. Furthermore, at any fixed number of layers, this gain decays to zero with increasing 
bandwidth ratio as well. We conclude that for general MEVIO systems, when the bandwidth ratio is 
high, layered digital transmission with large number of layers results in the largest improvement in the 
distortion exponent. 

In Figure [10] we plot the distortion exponent for a 2-block 2x2 MIMO channel. We observe that 
the improvement of HLS over LS, both operating with infinite number of layers, is even less significant 
than the single block case. However, HLS can still achieve the optimal distortion exponent for 6 < 1/2. 
Although BS is optimal for b > L'^MtMr = 16, both this threshold of L'^MtMr and the gap between 
the upper bound and BS performance below this threshold increases as L increases. 

VIII. Generalization to Other Sources 

Throughout this paper, we have used a complex Gaussian source for clarity of the presentation. This 
assumption enabled us to use the known distortion-rate function and to utilize the successive refinable 
nature of the complex Gaussian source. In this section we argue that our results hold for any memoryless 
source with finite differential entropy and finite second moment under squared-error distortion. 

Although it is hard to explicitly obtain the rate-distortion function of general stationary sources, lower 
and upper bounds exist. Under the mean-square error distortion criteria, the rate-distortion function R{D) 
of any stationary continuous amplitude source X is bounded as [31] 

Rl{D) < R{D) < Rg{D), (56) 

where Rl{D) is the Shannon lower bound and Rg{D) is the rate-distortion function of the complex 
Gaussian source with the same real and imaginary variances. 

Further in [32] it is shown that the Shannon lower bound is tight in the low distortion (D ^ 0), or, 
the high rate {R ^ oo) limit when the source has a finite moment and finite differential entropy. We 
have 

2h{X) 

lim D{R) 2-^ = 0. (57) 

R— >oo 27re 

The high rate approximation of the distortion-rate function can be written as D{R) = 2"^+'^(^), where 
0(1) term depends on the source distribution but otherwise independent of the compression rate. Since in 
our distortion exponent analysis we consider scaling of the transmission rate, hence the source coding rate. 
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logarithmically with increasing SNR, the high resolution approximations are valid for our investigation. 
Furthermore the 0(1) terms in the above distortion-rate functions do not change our results since we are 
only interested in the SNR exponent of the distortion-rate function. 

Although most sources are not successively refinable, it was proven in [33] that all sources are nearly 
successively refinable. Consider n layer source coding with rate of Ri bits/sample for layer i = 1, . . . , n. 
Define Di as the distortion achieved with the knowledge of first i layers and Wi = Ri — R{Di) as the 
rate loss at step i, where R{D) is the distortion-rate function of the given source. Throughout the paper 
we used the fact that this rate loss is for the Gaussian source [23]. Now we state the following result 
from [33] to argue that our high SNR results hold for sources that are nearly successively refinable as 
well. 

Lemma 8.1: (Corollary 1, [33]) For any < Dn < ■■■ < D2 < Di, (n > 2) and squared error 
distortion, there exists an achievable M-tuple {Ri, . . . , Rn) with Wk < 1/2, /c G {1, n}. 

This means that to achieve the distortion levels we used in our analysis corresponding to each successive 
refinement layer, we need to compress the source layer at a rate that is at most 1 bits/samplqj greater 
than the rates required for a successively refinable source. This translates into the distortion rate function 
as an additional 0(1) term in the exponent, which, as argued above, does not change the distortion 
exponent results obtained in this paper. These arguments together suggest that relaxing the Gaussian 
source assumption alters neither the calculations nor the results of our paper. 

IX. Conclusion 

We considered the fundamental problem of joint source-channel coding over block fading MIMO 
channels in the high SNR regime with no CSIT and perfect CSIR. Although the general problem of 
characterizing the achievable average distortion for finite SNR is still open, we showed that we can 
completely specify the high SNR behavior of the expected distortion in various settings. Defining the 
distortion exponent as the decay rate of average distortion with SNR, we provided a distortion exponent 
upper bound and three different lower bounds. 

Our results reveal that, layered source coding with unequal error protection is critical for adapting 
to the variable channel state without the availability of CSIT. For the proposed transmission schemes, 
depending on the bandwidth ratio, either progressive or simultaneous transmission of the layers perform 
better. However, for single block MISO/SIMO channels, BS outperforms all other strategies and meets 
the upper bound, that is, BS is distortion exponent optimal for MISO/SIMO. 

'We have Wf^ < 1 due to complex source assumption. 
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Appendix I 
Proof of Theorem I3.1I 

Here we find the distortion exponent upper bound under short-term power constraint assuming avail- 
ability of the channel state information at the transmitter (CSIt]^ Let C(H) denote the capacity of the 
channel with short-term power constraint when CSIT is present. Note that C(H) depends on the channel 
realizations Hi, . . . , H^. The capacity achieving input distribution at channel realization Hj is Gaussian 
with covariance matrix Qj. We have 

1 ^ / M R 

C(H) = jJ2 sup logdet I+— -H,Q,h] 

^ j=iQ.>:0,Et=it'-(Q.)<i'A^t ^ ^^^^ 

1 ^ / SNR \ 

^ Q,>:0,tr(Q,)<LA/t V V 



j=l Qj>:0,tr(Q,)<LA/, 
1 ^ 

- I X] det (I + L • SNRUj h] ) , 



(58) 



where the first inequality follows as we expand the search space, and the second inequality follows from 
the fact that LMtl — Qj ^ when tr(Qj) < LMt and logdet(-) is an increasing function on the cone 
of positive-definite Hermitian matrices. Then the end-to-end distortion can be lower bounded as 

L 

Din) = 2-^^(H) > JJ[det(I + L ■ SNRUjU])]-''/^. (59) 
i=i 

We consider expected distortion, where the expectation is taken over all channel realizations and analyze 
its high SNR exponent to find the corresponding distortion exponent. We will follow the technique used 
in [1]. Assume without loss of generality that Mt > Mr. Then from Eqn. ( |59l l we have 

L 

D{U) > Yl[det{I + L- SNRUjU])]-''/^, (60) 

L Mr 

> llllil + L-SNRXji)-'/'', (61) 
j=i i=i 

where Aji < Aji < ••• < Xjau the ordered eigenvalues of HjHt for block j = 1,...,L. Let 

Xji = 57Vi?-"-. Then we have (1 + L • SNRXji) = 57Vi?(i-°-)+. 



^We note that a similar upper bound for L = 1 is also given in [11]. We derive it for the L-block channel here for completeness. 
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The joint pdf of ocj = [aji, . . . , ajMt] for j = 1, . . . , L is 



i=l 



exp (- SNR^'A , 



.i<k 

where KMt,Mr is a normalizing constant. We can write the expected end-to-end distortion as 
E[D{U)\ = J D{H)p{oti)...p{aL)doci...daL, 

/r Mr 
+ SNR\u)-^l^p{oci)doti 



(62) 



(63) 



(64) 



where we used the fact that Qj's are i.i.d. for j = 1, . . . , L and that the constant multiplicative term in 
front of SNR does not affect the exponential behavior. Since we are interested in the distortion exponent, 
we only need to consider the exponents of SNR terms. Following the same arguments as in [1] we can 
make the following simpUfications. 



j]\{\ + SNR\u)-'''^p{cxi)doii = j fj[(l + 5iVi?i-"^-) 

Mr 



-b/L 



i=l 



i<k 



Mr Mr 



=1 



Mr 



= SNR-^' 



Again following the arguments of the proof of Theorem 4 in [1], we have 



Mr 



Ai = inf y^{2i-l + Mt-Mr)aii + -{l-au)+. 



(65) 



i=l 



The minimizing cti can be found as 



Olli 



if l<2i-l + Mt-Mr 



1 if l>2i-l + Mt- Mr. 



(66) 



Letting E[D{n)\>SNR-^"'' , we have A^^ = LAi, and 



Mr 



A 



UB 



1=1 



L ^ min <j -, 2i - 1 - |- . 



(67) 



Similar arguments can be made for the < case, completing the proof. 
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Appendix II 
Proof of Lemma|4~T] 



Let t* be the optimal channel allocation vector and r* be the optimal multiplexing gain vector for n 
layers. For any e > we can find t with ti G Q and U = ^ where \t* — ii\ < s. Let ii = ^i/ Pi 

where ^ Z,, pi ^ and 9 = LCM{pi, . . . , pn) is the least common multiple of pi,...,pn- Now 
consider the channel allocation i = [1/6, ... ,1/6]'-^, which divides the channel into 9 equal portions and 
the multiplexing gain vector 



1)---)'1)'2'' 



(68) 



oil times ets times ei„ times 
Due to the continuity of the outage probability and the distortion-rate function, this allocation which 

consists of 9n layers achieves a distortion exponent arbitrarily close to the n-layer optimal one as e ^ 0. 

Note that {A^^}'^^^ is a non-decreasing sequence since with n layers it is always possible to assign 

tn = and achieve the optimal performance of n — 1 layers. On the other hand, using Theorem 13.11 

it is easy to see that {A,^'^} is upper bounded by d*{0), hence its limit exists. We denote this limit by 

A^'^. If we define A^"^ as the distortion exponent of n-layer LS with equal channel allocation, we have 

A^'^ < A^'^. On the other hand, using the above arguments, for any n there exists m > n such that 

A^-^ > A^"^. Thus we conclude that 

lim A^^ = lim A^^^ = A^^ 

n—foo n—KX) 

Consequently, in the limit of infinite layers, it is sufficient to consider only the channel allocations that 
divide the channel equally among the layers. 

Appendix III 
Proof of TheoremI4.2I 



We will use geometric arguments to prove the theorem. Using Lemma 14.11 we assume equal channel 
allocation, that is, t = . . . , We start with the following lemma. 

Lemma 3.1: Let / be a line with the equation y = —a{x — M) for some a > and M > and let k 
for i = 1, . . . , n be the set of lines defined recursively from n to 1 as y = {h/n)x + (ij+i, where & > 0, 
dn+i = and di is the y— component of the intersection of 4 with I. Then we have 

a 



di = Ma 

with 



1 



(69) 



a + b/n ^ 

lim di = Ma(l- e"^/") . (70) 



n— >oo 
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d*{r) 
A 




-i M. - i + 1 

Fig. 11. DMT curve of an Mt X Mr MIMO system is composed of A/, = min{Aft, Mr} line segments, of which the i'th one is shown in 
the figure. 



Proof: If we solve for the intersection points sequentially we easily find 

Of a ^ 



dk - 4+1 = M- , (71) 



for = 1, . . . , n, where dn+i = 0. Summing up these terms, we get 

n— fc+l 

/ fv \ 

dk = Ma 



1 ■ " 



a + b/n 



(72) 



In the case of a DMT curve composed of a single line segment, i.e., = 1, using Lemma [3TT] we 
can find the distortion exponent in the limit of infinite layers by letting M = 1 and a = M*. However, 
for a general Mt x Mr MIMO system the tradeoff curve is composed of line segments where the 
ith segment has slope jM^ — Mt \ + 2i — 1, and abscissae of the end points — i and M^: — i + 1 as in 
Fig. [TT] In this case, we should consider climbing on each line segment separately, one after another in 
the manner described in Lemma 13.11 and illustrated in Fig. IH Then, each break point of the DMT curve 
corresponds to a threshold on b, such that it is possible to climb beyond a break point only if b is larger 
than the corresponding threshold. 

Now let M = M^ — i + 1, a = \Mr — Mt \ + 2i — 1 in Lemma [3?T] and in the limit of n ^ cxd, let kin 
be the number of lines with slopes h/n such that we have d„ = \Mr — Mt \ + 2i — 1. Using the limiting 
form of Eqn. (1721 ) we can find that 

, \Mr-Mt\+2i-l (M,-i + l\ 
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This gives us the proportion of the lines that climb up the pth segment of the DMT curve. In the general 
MIMO case, to be able to go up exactly to the pth line segment, we need to have J2^Zi kj < 1 < J2^=i ^j- 
This is equivalent to the requirement Cp_i < b < Cp in the theorem. 

To climb up each line segment, we need kin lines (layers) for i = I, . . . ,p — l, and for the last segment 
we have (1 — Yl^j=i kj)n lines, which gives us an extra ascent of 



iM,-p+l){\Mr- Mt\ + 2p-l){l-e 



\Mr-Mj.\+2p-l 



on the tradeoff curve. Hence the optimal distortion exponent, i.e., the total ascent on the DMT curve, 
depends on the bandwidth ratio and is given by Theorem 14.21 

Appendix IV 
Proof of Lemma|4~41 



As in the proof of Theorem 13.11 in Appendix I, we let Xi = SNR-°'^ and Xi = SNR-f^^ for i = 
1, . . . , M*. The probability densities of A and A and their exponents a and f3 are given in Appendix I. 
Note that since H is a submatrix of H, A and A as well as a and /3 are correlated. Let p{a, (3) be the 
joint probability density of o; and /3. If Mt < Mr, H and H coincide and A = A, a = /3. 

We can write 



(74) 



= [ SNR-^^-f^'"--^*p{a)da, (75) 

= [ 57Vi?-(^-^— )^ [ p{cx, (3)d(3dcx, (76) 

= [ SNR-^^-f^--'^^ [ p{cx,(3)dcxdf3, {11) 

< [ SNR-^^-^"^'-'^^p{(3)d(3, 



where 



(78) 

SNR-^", (79) 



fi = inf (1-/?^,,)+ + V(2i-1)A. (80) 

i=l 



The minimizing /3 satisfies /3i E [0, 1] and /32 = • • • = Pm, = 0, and we have fi = 1. 
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Appendix V 
Proof of Theorem [5TT] 

The mutual information between Yj^ and defined in ( [35] ) can be written as 

2^(Yfc;Xfc) = I(Yfe; Xfc, Xfc+i) — I(Yfc; Xfc+i|Xfc), (81) 



log det f I + ^^HHt") - log det f I + ^^^^HRt 

det fl + lp^HHt) 

log } _^ (82) 

det ( I + •g^^^+^ HHt 



Mt 

For layers k = 1, . . . ,n — 1, and the multiplexing gain vector r we have 



PL = Pri 



Pr < 



and 



det (l + ^HHt) 

^ ■ — t sm TV < SNR 

det (l + ^^jg;+^ HHtj 

H : 11^=1^^+^. -^-^ < SNR^^ } , (83) 

Pout = |h : n (l + ^A.) < 5iVi?- 1 . (84) 

where Ai < A2 < • • • < Xm, are the eigenvalues of HH^ (HtH) for Mt > Mr (Mt < Mr). Let 
Aj = SNR^°'\ Then for the power allocation in (|43] ). conditions in Eqn. (|83l ) and Eqn. ( [84l ) are, 
respectively, equivalent to 

M, 

^(1 - ri rfc_i - efc_i - ai)+ - ^(1 - n - - Qi)+ < r^, 

i=l i=l 

and 

- n r„_i - e^-i - ai)+ < r„. 

i=l 

Using Laplace's method and following the similar arguments as in the proof of Theorem 4 in [1] we 
show that, for /c = 1, . . . , n, 

= SNR-'''', (85) 

where 



inf J2 i\^t -Mr\ + 2i-l) a^. (86) 

For fc = 1, . . . , n — 1 
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^fc = I a = [ai, . . . , omJ G : ai > • • • > > 0, 

- n "^fc-i - efe-i - ttj)"^ - Z]f=i(l - n rk-ek- ai)~^ < rfcj. 

while 

i„ = { « = [ai, . . . , OAfJ G MA^*+ : ai > • • • > om. > 0, 

Y.i=l (1 - n ?^n-l - Cn-l - "i)^ < "^n}- 

The minimizing a for each layer can be explicitly found as 

ai = l-ri rfc„i - e^.i, for z = 1, . . . , M* - 1, 

and 

"M. = 1 - ri - • • ■ - Tfc-i - rfe - efc-i. 
Letting — > for A; = 1, . . . , n — 1, we have 

4 =M*M,(l-ri rfc_i)-(M* + M, -l)rfc. (87) 

Note that the constraint X]"^^ ^ 1 makes the sequence {1 — ri — • • • — r^}^^]^ decreasing and greater 
than zero. Thus constitutes an increasing sequence. Therefore, using ( [37] ) and ( [39l ). in the high SNR 
regime we have P^^^ = P^^f., and 4<i(rfc) = 4. 

Appendix VI 
Proof of Theorem [53] 

Using the formulation of A^'^ in (|4TI ) and successive decoding diversity gains of the proposed power 
allocation in Theorem 15.11 we find the multiplexing gain allocation that results in equal SNR exponents 
for all the terms in (|4TI ). 

We first consider the case b > {Mt — l){Mr — 1). Let 

i,-(M,-l)(M.-l) 



For < ryo < 1> we set 



If % ^ 1> we set 



MtMril - m) 
MtMr — brjQ 

ri = ??o~^n, for i = 2, ...,n. (90) 



ri = ■ ■ ■ = Vn = , ^ , ^ ^ff''. . ^ 7- (91) 



nMtMr + Mt + Mr - I 
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Next, we show that the above multiplexing gain assignment satisfies the constraint Ylll=i — 1- ^^^r 
r/o < 1, 6 < MtMr and 

ri + • • • + r„ = ri (1 + r/o + • • • + t/q"^) , 
M,M,(l-r?g) 
MtMr-br]'^ ' 
< 1. (92) 

On the other hand, when % > 1, we have Yri=i n = uM.iZ+mI+m^-i < 1- 
Then the corresponding distortion exponent can be found as 

^BS^l b V/M%f if {Mt-l){Mr-l)<b<MtMr, 

" ^ ^(AW if b>MtMr. 



nMtMr+Mt+M^-l 

For {Mt - k - l){Mr - k - 1) < b < {Mt - k){Mr - k), k = 1, . . . , - I, we can consider the 
{Mt — k) X (Mr — k) antenna system and following the same steps as above, we obtain a distortion 
exponent of 

(Mt - k){Mr - k)il - 7]l) 

{Mt-k){Mr-k)-br]l ' 

where 1]^ is defined in (l46l ). 

In the limit of infinite layers, it is possible to prove that this distortion exponent converges to the 
following. 



3 if 0<6<MtM„ 

A^^ = lim A^^ = { (94) 

MtMr if 6 > MtMr. 



n— >oo 



Appendix VII 
Proof of Theorem [6TT] 

We transmit codewords of each layer across all fading blocks, which means that P^t in Eqn. ([83 
becomes 



Pout = Pri 



1 det (l + ^HiHj'^ 

(Hi, . . . , Hi) : - V log )^ = V < rk log SNR 

det I + ^H.Ht^ 



for = 1, . . . , n — 1, and P^^t can be defined similarly. 
For the power allocation in (1531 ). above outage event is equivalent to 

L M, 

y^(l - Ln Lrk-i - Lek-1 - aj^i)~^ - (1 - Ln Lr^ - Let - aj^i)'^ < Lvk- 

j=i i=i 

Following the same steps as in the proof of Theorem 15.11 we have 

= 5iVi^-^^ (95) 
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where 

L M, 

dk = inf ^^(|Mt-M,|+2z-l)a,-,. (96) 



"e^^ j=i i=i 



For A: = 1, . . . , n — 1 



I Q = [ai, . . . , OLA/J e R^^*+ : aj^i > ■ ■ ■> aj^M, > for j = 1, . . . , L 

Et=i Ei^=i(l - - Lefc-i - Oj- i)+ 

-(1 - Lri Lrk - Lek - Oj,*)"*" < Lrk^. 

The minimizing a for each layer can be explicitly found as 

aj^i = I- Lri Lrk-\ - Lek-i, for j = 1, . . . , L, and i = 1, . . . , - 1, 

and 

aj,M, = I- Lri Lrk-i - - Lek-i, for j = 1, . . . , L. 

Then, letting e„_i 0, the diversity gain is obtained as 



dk = L 



M*M*(1 - Lri Lrk-i) - (M* + - l)rfc 



(97) 



We skip the rest of the proof as it closely resembles the proof of Theorem 15.31 in Appendix IVll 
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